Modeling Market Mechanism with Minority Game 
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Using the Minority Game model we study a broad spectrum of problems of market mechanism. 
We study the role of different types of agents: producers, speculators as well as noise traders. The 
central issue here is the information flow : producers feed in the information whereas speculators 
make it away. How well each agent fares in the common game depends on the market conditions, as 
well as their sophistication. Sometimes there is much to gain with little effort, sometimes great effort 
virtually brings no more incremental gain. Market impact is shown to play also an important role, 
a strategy should be judged when it is actually used in play for its quality. Though the Minority 
Game is an extremely simplified market model, it allows to ask, analyze and answer many questions 
which arise in real markets. 



I. INTRODUCTION 

Recently it became possible to study markets of het- 
erogenous agents, in particular in the form of the so- 
called Minority Games (MG) QJ^] . Since long time prac- 
titioners of the market, as well as some economists have 
criticized the main-stream economics where a so-called 
representative agent plays the central role. Many promi- 
nent economists like Herbert Simon |3j , Richard Day, and 
Brian Arthur jij have been forceful proponents of the 
" bounded rationality" and " inductive thinking" . How- 
ever, though many people join in unison in their crit- 
icism of the main-stream, their alternative approaches 
and models do not command consensus yet. 

The MG is inspired by Arthur's "El FaroF model [|, 
which shows for the first time how the equilibrium can 
be reached using inductive thinking. Whereas El Farol 
model is about the equilibrium, our MG model is about 
fluctuations. In a sense MG gives us a powerful tool to 
study detailed pattern of fluctuations, the equilibrium 
point is trivial by design. It is the fluctuations that play 
the dominant role in economic activities, like the market 
mechanism. MG allows us to study in a precise manner 
how is the approach to equilibrium, how the agents try to 
outsmart each other, for their selfish gain, compete for 
the available marginal information (any deviation from 
the mid-point represents exploitable advantage). It is 
for this residual margin all the agents fight for, resem- 
bling the real markets. The importance to market mech- 
anism is primordial, as any practitioner can attest. Neo- 
classical economics would tell us that the MG, as in a 
competitive market, does not offer consistent gain, based 
on the Efficient Market Hypothesis (EMH). However, 
if some agents stop playing (i.e. choosing dynamically 
among the two sides) , they will give off information that 
the other more deligent dynamic agents make away. In 
the sense the equilibrium can be only dynamically main- 
tained, any relaxing would imply a relative disadvantage. 
Isn't that the same thing in real markets? 

Studying a model of market mechanism opens up many 



detailed questions, which practitioners have to face con- 
stantly but the main-stream economists do not have any 
clue to answer them. For instance, in a model like MG 
agents interact with each other through a common mar- 
ket, what information each agent brings in? What gain 
each agent takes out? How sophisticated should an agent 
be? What is realizable gain objective? What is the role 
of noise traders? How about insider trading (an agent 
processing privileged information about fellow agents)? 
What is the market impact of an otherwise clever strat- 
egy? The list is obviously endless. The point we want to 
make here is that with so little to start with, and with 
so many questions relevant to real markets one can hope 
for a qualitative answer. 

After two years since the MG's birth, during which 
much work has revealed its extremely rich structure 0, 
an analytical approach leading to its exact solution has 
been found . Unfortunately the main progress is still 
confined in the physics community. We hope that, with 
this paper, this will change: The aim is to convince peo- 
ple, including economists hopefully, that many concrete 
questions about market mechanisms can be asked and an- 
swered, in precise and analytical way, using the approach 
of refs. §,0]. In fact the MG can be used as a flexible 
platform and different handles can be added and manip- 
ulated almost at wish. To achieve our goal, the anaytic 
approach shall be supplemented by numerical simulations 
to confirm its validity. More technical parts and heavy 
calculations shall be dealt with in the appendices. 



II. MAIN RESULTS 

Here below we give a list of salient points of our paper: 

1. Diversification of ideas. If an agent has different 
alternative strategies, the latter may have better 
to be diversified, i.e. not too much correlated. We 
show the effects of diversification. 

2. Markets have two types of agents: producers and 
speculators. The former do not have alternative 
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strategies; the latter are represented by the normal 
agents of the standard MG. Producers provide in- 
formation into the market, upon which speculators 
feed. For the first time it is possible to demonstrate 
that producers and speculators need each other, 
they live in a symbiosis. However, benefits to each 
group are not equal, depending on the parameters. 

3. Agents are not obliged to play, if they do not see 
a possible gain. We generalize MG to let agents 
to have the option of not playing. In the presence 
of producers, markets appear to be attractive and 
more speculators are drawn into the fore. 

4. Noise traders. One may wonder if some traders 
decide to be pure noise traders, i.e. they use com- 
pletely random strategy, what is the "harm" done 
to other market participants (producers and specu- 
lators), as well as to themselves. In the information 
rich, they appear to increase volatility and in the 
herding-effect phase, they actually make the mar- 
ket perform better. 

5. Despite of the fact that agents start equally 
equipped, there are better and worse agents and 
the rank of the agents has an interesting non Gaus- 
sian "bar-code" structure. 

6. Sometimes it pays to increase the capacity of 
an agent's brain, say add one more unit in M. 
This will give enormous advantage to the better 
equipped agent in the crowded phase (or symmet- 
ric phase) where information on the range M is 
exhausted, whereas such a feature becomes a dis- 
advantage in the information rich phase. 

7. Does it pay to have more strategies as alternatives? 
In general yes. Here we calculate the relative ad- 
vantage by having more alternatives. We also show 
that, due to self-market impact, the imagined gain 
differs from the real gain, a fact known too well to 
market practitioners. Even each agent has many 
alternatives, they actually use only a small number 
of them. 

8. Some agents may get illegal information about oth- 
ers. Just like a stock broker who knows his clients' 
orders before execution. Hence he has privileged 
information and should be barred from trading. 
One agent who spies on fellow agents enjoys trading 
advantages. We measure how much is this effect, 
as the number of fellow agents whom you spy in- 
creases, how much would be your gain. 

III. FORMALISM AND REVIEW 

Our model of market consists of iV agents which, for 
simplicity, can take only one of two actions, such as "buy" 



and "sell" at each time step t. We represent this assuming 
that each agent i = l,...,N, at time t, can either do the 
action a,i(t) = +1 or the opposite action ai(t) = — 1. 
Given the actions of all agents, the gain of agent i is 
given by: 

JV 

g l {t) = -a t (t)A(t), where A(i) = ^ aj (<). (1) 

This equation models the basic structure of market in- 
teraction where each agent's payoffs are determined by 
the action taken and by a global quantity A(t), which 
is usually a price and it is determined by all of them. 
For the sake of simplicity, we assume here a linear de- 
pendence of gi(t) on A(t). Other choices, such as gi(t) = 
— cii(£)sign A(t) in refs. |j],|]j9|, can be taken without af- 
fecting qualitatively the results we shall discuss below. 
This interaction clearly rewards the minority of agents 
(those who took the action dj(t) = — signA(t)) who gain 
an amount |^4(i)| and punishes the majority by a loss 
— hence the name minority game (l). There are 

always more losers than winners and agents have no way 
of knowing what the majority will do before taking their 
actions. 

All agents have access to public information which is 
represented by an integer variable \i taking one of P val- 
ues. At time t information "takes the value" fi(t). We 
shall also call n(t) history since originally this informa- 
tion has been introduced as encoding the record of the 
past M — log 2 P signs of A(t) with M bits. It has how- 
ever been shown JTof that if /z(£) is randomly drawn in 
{ 1 , . . . , P} one recovers the same results (see also the dis- 
cussion in refs. §J^j). We shall henceforth consider this 
second, simpler case. When having access to some infor- 
mation, agents can behave differently for different values 
of n(t), eventually because of their personal beliefs on 
the impact that information fi(t) shall have on the mar- 
ket's outcome A(t). Strictly speaking A(t) only depends 
on what agents do, so fj,(t) has no direct impact on the 
market. However if agents behavior depends on fi(t) also 
A(t) shall depend on it, and we denote it by A^ l \t). 

How do agents choose actions under information /i(t)? 
If agents expect that /i(t) contains some information on 
the market, they will consider forecasting strategies which 
for each value of \i suggest which action a M shall be done. 
There are 2 P such strategies, and we assume, for the time 
being, that each agent just picks S such rules randomly 
(with replacement) from the set of all 2 P strategies. The 
action of agent i if she follows her s th strategy and the 
information is fi is denoted by a(fj. Therefore, if Si(t) 
is the choice made (in a way we shall specify below) by 
agent i at time t, her action becomes ai(t) — > i and 
correspondingly, her gain [Eq. ([!])] becomes 
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9i(t)=-<$),i Afl(t Ht), where A"W(i) 



N 
3 = 1 



(2) 



In this paper, we mainly focus on 5 = 2. This case 
contains all the richness of the model and allows a more 
transparent presentation. All the results discussed below 
can be extended to 5 > 2 along the lines of ref. pj. For 
5 = 2 we can adopt a notation where each agent controls 
a variable Si G {J.> f}> with the indentification f = +1 and 
1= —1. This is useful to distinguish strategies Sj from 
actions a, . It is convenient to introduce the variables 



u u 



(3) 



With these notations, the action taken by this agent in 
reaction to the history /i is: 



(4) 



so represents the part of i's strategies which is fixed, 
whereas £f is the variable part. We also define = 
J2i so that 



JV 



N 



A"(i) = £\ 



<(*) 



(5) 



Each agent updates the cumulated virtual payoffs of 
all her strategies according to 



U Sli (t + l) = U Sli (t)-A^(t) a#*> 



(6) 



The quantity Ui >s is a "reliability index" which quantifies 
the agent i's perception of the success of her s th strategy. 
Ui <s (t) is the virtual cumulated payoff that agent i would 
have received up to time t if she had always played strat- 
egy s (with others playing the strategies Sj(t') which they 
actually played at times t' < t). Virtual here means that 
this is not the real cumulated payoff but rather that per- 
ceived by agent i. These differ, as explained below and in 
ref. , because agents neglect their impact on the mar- 
ket (i.e. the fact that if they had indeed always played s 
the aggregate quantity A(t) would have been different). 

Inductive dynamics |Q,|lJ consists in assuming that 
agents trust and use their most reliable strategy, which 
are those with the largest virtual score: 



Si(t) = arg max Ui S (t). 

s€{U} ' 



(7) 



More generally one can consider a probabilistic choice 
rule - the so called Logit model [jllj - such that P(si(t) = 
s) cx exp\TU a>i (t)], (see Then Eq. (0) is recov- 

ered in the limit T — > oo . As in ref. [|l3l , we find it useful 



to introduce the variables Aj(t) 
dynamics reads 



U, 



U, 



Their 



(8) 



(9) 



A i (t + l) = A i (t)-A^(t)^ 
and Eq. (0) becomes 

Si(t) = signAj(i). 

A. Notations on averages 



We define the temporal average of a given time depen- 
dent quantity R(t) as 



(10) 



This quantity can be decomposed into conditional aver- 
ages on histories, that is 



T 

1 — >oo 1 * — rf 

t--l 



(11) 



Note that the factor P and the relation (5^),**) — ^/P 
imply that is a conditional average. More precisely 
it is the temporal average of the quantity R(t) subject to 
the condition that the actual history jj,(t) was[] fi. Finally, 
averages over the histories \x of a quantity i? M are defined 



as 



R 



i p 

-Yr». 

P <H 



(12) 



B. Quantities of interest 

With these notations, let us now discuss the main 
quantities which characterize the stationary state of the 
system. The main free parameter, as first observed in 
ref. Si, is 



P 

a= N 



(13) 



and we shall eventually consider the thermodynamic 
limit where N, P — > oo with a fixed. The first quantity 
of interest is 



a 2 = (A*) = f> 2 + 2 ]T (at) ( Si ) + ]T Zitj (* 



(14) 



1 This implies that the number of iterations must be propor- 
tional to P in any numerical simulation 
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This equals the total loss of agents 

a 2 , (15) 

i 

so it is a measure of global waste. It also quantifies the 
market's "volatility" i.e. the fluctuations of the quan- 
tity A(t), and is related to the average "distance" be- 
tween agents (see appendix |X]). Even though (A) = 0, 
by symmetry, it may happen that for a particular /it, the 
aggregate quantity A(t) is nonzero on average, i.e. that 
(A 1 *) 7^ 0. In order to quantify this asymmetry, we intro- 
duce the quantity 

JV 

i=l i,j 

(16) 

Note that the only difference with a 1 lies in the diagonal 
terms (i = j) of the last sum. Indeed we assume that 
(siSj) = (si){sj) for i ^ j, whereat]] (s 2 ) = 1 ^ (s,) 2 . 
Indeed, we can write 

N 

»=i 

If H > 0, the game is asymmetric: At least for some /j, 
one has that (A^) ^ 0. This implies that there is a best 
strategy a^ cst = —sign {A^) which in principle could give 
a positive gain | (A) \ — 1 Q. In economic terms we may 
say that the system is not arbitrage free, and that H is a 
measure of the perceived arbitrage opportunities present 
in the market. As a function of a = P/N the system 
displays a phase transition with symmetry breaking [jl3| : 
For a > a c the symmetry between the two signs of A(t) 
is broken. 

H plays a particular important role because in refs. 
HQ it has been shown that the inductive dynamics is 
equivalent to a dynamics which minimizes H in the dy- 
namical variables m, = (s,). Therefore the ground state 
properties of the Hamiltonian H yields the stationary 
state of the system. H is a spin glass Hamiltonian where 
fl^i are the local magnetic fields and the coupling be- 
tween two agents. These play the same role as quenched 



2 This amounts to say that the fluctuations in time of s; 
around its average (si) are uncorrelated with Sj — (sj). 

This assumption fails when crowd effects occur, i.e. in the 
symmetric phase, and our theory will accordingly fail to de- 
scribe these effects. 

3 Here the —1 comes from the fact that if the strategy is 
actually played A^ — * A 11 + a(* t and "in principle" means 
that A^ would also change as a result of the fact that other 
agents would also react to the best strategy agent. 



disorder in spin glasses. This system is of mean field 
type since interactions are infinite ranged. For this 
reason, the statistical mechanics approach to disordered 
systems jl4],[l5| via the replica method yields exact re- 
sults for these models (see appendix ^|). The behavior 
of each agent is completely determined by the difference 
of her cumulated payoffs A,;. For long times, Aj ~ Uji 
where 

Vi = (A(t + 1) - A t (t)) = -2(1)1" (18) 

If Vi agent i shall stick to only one strategy Sj = 
signup, whereas if m — she will sometimes use her | 
strategy and sometimes her J, one. This is quantified 
by m,i, and a global measure of the fluctuations in the 
strategic choices of agents is given by 

1 N 

i=l 

This quantity also emerges naturally from the replica ap- 
proach where it plays a key role. 

IV. SPECULATORS WITH DIVERSIFIED 
STRATEGIES 

In the standard MG, it is assumed that the agents 
draw all their strategics randomly, and independently 
One can argue that the agents can be less simple-minded 
so that they first draw a strategy, and then following 
their needs or what seems the best for them, draw the 
others strategies. For instance, if S = 2, an agent can 
believe that one strategy is enough and sticks to it (or 
takes two same strategies). Reversely, an agent might 
believe that it is better to have one strategy and the 
opposite one. More generally, we suppose that all the 
agents^] draw their second strategy according to 

P(a^ = a[) = c V/z. (20) 

The parameter c counts the average fraction of his- 
tories for which the agents' choices are biased, that 
is, the average correlation between their two strategies. 
The standard MG corresponds to the independent case 
c = 1/2, while having only one strategy is obtained with 
c = 1. The other very special case is c = : all agents 
have two opposite strategies, thus there is no asymme- 
try in the outcome. As a result, the game is always in 
the symmetric phase : as a is varied, no phase transi- 
tion occurs. Increasing c has two effects : on one hand 
it increases the bias of the outcome f2 M ~ VcN, on the 



4 This can be generalized to a c for each agent ; exact results 
also arise from the replica calculus 
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other hand it reduces the ability of the agents of being 
adaptative, since they learn something about the game 
only when £f ^ (see Eq (^) ) , which happens in average 
for (1 — c)P histories. The fact that the biases depend on 
c too implies that the second order phase transition also 
occurs when this parameter is varied. With the replica 
formalism (see appendix |^) , one gets the phase diagram 
of the MG with parameter c (see figure ^) . In the stan- 
dard MG, one varies a (dot-dashed vertical line). If one 
fixes a and changes c, the symmetry is also broken (any 
horizontal line). Note that if c = and a > 1, an in- 
finitesimal c breaks the symmetry of the game. 



a (th.) 
a t (exp.) 




FIG. 1. Phase diagram of the Minority Game with diver- 
sified strategies. The phase transition in the standard MG 
corresponds to the dash-dotted vertical line c = 1/2. The 
circle are numerical data 



V. SPECULATORS AND PRODUCERS 

Real markets are not zero sum games The fact 

that most participants are interested in playing is beyond 
doubt. In real markets the participants can be grossly di- 
vided into two groups: Speculators and Producers Jl6| . 
Producers can be characterized by those using the market 
for purposes other than speculation. They need market 
for hedging, financing, or any ordinary business. They 
thus pay less or no attention to "timing the market". 
Speculators, on the other hand, join the market with the 
aim of exploiting the marginal profit pockets. The two 
groups were shown to live in symbiosis Jl6[ ] : the former 
inject information into the market prices, and the lat- 
ter make a living carefully exploiting this information. 
One may wonder why do producers let themselves be 
taken advantage of. Our answer is that they have other, 
probably more profitable business in mind. To conduct 
their business, they need the market, and their expertises 
and talents in other areas give them still better games to 
play. Speculators, being less capable in other areas, or 
by choice, make do exploiting the "meager margin" left 
in the competitive market. 

In our MG, these general questions can be studied in 
detail. Producers will be limited in choice, their activ- 
ities outside the game are not represented. We define 



a speculator as an normal agent, and a producer as 
an agent limited to one strategy. Thus the latter have 
a fixed pattern in their market behavior and put a mea- 
surable amount of information into the market, which 
is exploited by the speculators. We take a population 
of N speculators and always define a = P/N. We add 
pN heterogeneous producers, so that p is the fraction of 
producers per speculator. The outcome is then 



A» 



spec 



^prod- 



(21) 



The bias induced by the producers adds to the one 
caused by the speculators, so that the total bias is of 
order yj(c + p/2)N. Therefore the phase transition can 
be obtained at fixed P by varying either N, c, or the 
number of producers. Let us begin with the last possi- 
bility. We fix c = 0, P = 2 s , N = 641 and plot the 
gains of the speculators and producers as a function of 
the number of producers (see figure |^) . In the symmetric 
phase, the speculators wash out all the available infor- 
mation, thus, by symmetry, the gain of the producers 
(squares) is zero. As the number of producers increases, 
the gain of the speculators (circles) stays negative but 
grows monotonically, while the gain of the producers re- 
mains zero as long as the symmetry of the outcome is not 
broken. When the number of producers reaches a critical 
value, the speculators are no more able to remove all the 
available information, therefore the (second order) phase 
transition occurs (dashed line). Beyond this point, the 
producers lose more and more, while some (frozen) spec- 
ulators gain more than zero in average (see VIIl). At 
one point, the gains of speculators and producers are the 
same. Finally, there are enough producers to make the 
gain of the speculators positive on average. 




Number of producers [P] 



FIG. 2. Gain of producers and speculators versus the num- 
ber of producers (in P unit); the number of speculators is 
fixed at N = 641 (c = 0, M = 8, S = 2, a = 0.4, average over 
200 realizations). The lines are theoretical predictions. 

As illustrated by figures || and |J, if the number of spec- 
ulators changes the behavior is qualitatively the inverse 
of the one of figure |[ The gain of producers increases as 
the number of producers grows; similarly, the gain of the 
speculators decreases when N increases for sufficiently 
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large N. If there are not enough producers, the game is 
always negative sum for the speculators, and their gain 
has a maximum (see figure 0). 



G 



prod 



1 




1 2 

Number of speculators [P] 

FIG. 3. Gain of producers and speculators versus the num- 
ber of speculators (in P unit) ; the number of producers is fixed 
at 64 (c = 0, M = 8, S = 2, average over 200 realizations). 
The lines lines are theoretical predictions. 




Number of speculators [P] 

FIG. 4. Gain of producers and speculators versus the num- 
ber of speculators (in P unit) ; the number of producers is fixed 
at 256 (c = 0, M = 6, S = 2, average over 200 realizations). 
The lines are theoretical predictions. 

We now expose exact analytical results concerning the 
gain of the two types of agents. They rely on the gen- 
eralization of the approach of refe. jHQ: the calculus is 
carried out in detail in appendix O. Let us introduce 
G spcc , the total gain of the speculators and G pro d, the 



one of the producers. From Eq ( |l5| ) 



G s 



G 



prod 



-a 2 . 



(22) 



The results depend on the ratio p between the number 
of producers, on the number of speculators and on c, the 
parameter introduced in the previous section. We obtain 



er^ _ c + p+ (1 - c)Q 
~N~ (1 + X) 2 



+ (l-c)(l-Q) 



(23) 



where \ is the magnetic susceptibility of the system, and 
Q is defined in section HI B. These two quantities depend 
on a and on (l+p)/(l— c) (see appendix^). The average 
gain per producer is 



pN 1 + x 

and the average gain per speculator is 



(24) 



N 



c + p+{l-c)Q 
(1 + X) 2 



(l-c)(l-Q) + 



'X 
(25) 

Figures ||, || and f| completely agree with analytical re- 
sults; note that the small deviations are finite size effects. 
The fact that the gains of producers and speculators only 
depend on the ration p and not on how many producers 
and speculators there are in the game explains why fig- 
ures | and | look very much like the inverse of figure 

I 

As it emerges for the replica calculus, the critical point 
a c only depends^] on (1 + p)/(l — c) (see figure ||), that 
is, on the distribution of the quenched disorder. Numer- 
ical data (circles) completely agree with our results. The 
vertical line corresponds to the standard MG (p = and 
c = 1/2). A more intuitive version of this phase diagram 
is shown in figure ^| for c = 0. 



■ a (th.) 
<\ (exp.) 



Asymmetric phase 



Symmetric phase 



(1+p)/(l-c) 

FIG. 5. Phase diagram q c [(1 + p)/(l - c)] 



Symmetric phase 
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Asymmetric phase 


- G« pec =0 



Number of producers [P] 
FIG. 6. Phase diagram, and zero sum gain for speculators 
with c = 



°This explain why evolutionary schemes that preserve the 
distribution of the quenched disorder have the same a c 
while others that involve Darwinism, shift a c [fun 



G 



The game becomes favorable, on average, for the spec- 
ulators when their average gain is greater than zero. Us- 
ing Eq (p5|), one can plot the curve of zero sum gain for 
the speculators (see figure ||). One can see that the num- 
ber of producers must be greater than 1.868 .. .P (this 
value depends on c) in order to make the game positive 
sum for the speculators; this is consistent with numerical 
simulations (figures || and ||). 

The main message of these results is that producers 
always benefit from the presence of speculators, and re- 
versely : both types of agents live in symbiosis. Indeed, 
the producers introduce systematic biases into the mar- 
ket, and without speculators, their losses would be pro- 
portional to theses biases. The speculators precisely try 
to remove this kind of bias, reducing also systematic fluc- 
tuations in the market, thus reducing the losses of the 
producers and their own losses. Moreover, the efforts of 
speculators yield a positive gain only if the number of 
producers is sufficiently large. In this respect the sym- 
metric phase, where producers do not loose and specu- 
lators loose a lot, is unrealistic: real speculators would 
rather withdraw from a market which is in this phase, 
thus increasing a, and recovering the asymmetric phase. 
This suggests that a grand-canonical MG is much more 
realistic []. Here we briefly present an over-simplified 
"grand-canonical" MG. An agent enters into the market 
only when she has a strategy with virtual points greater 
than zero. As a result, the game is always in the asym- 
metric phase, but almost at the transition point : the av- 
erage losses of the producers are always extremely small 
(see figure ^). When the number of producers increases, 
the a priori asymmetry of the outcome increases, and 
more and more agents actually play the game (see figure 
thus in this situation, the producers give incentives to 
play to the speculators. Accordingly, the average gains 
of the speculators is much higher in this grand-canonical 
MG than in the corresponding canonical MG. 
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Number of producers [P] 



See also [|p||I?J 



FIG. 7. Average gain per agent versus the number of pro- 
ducers (in P units) in the grand canonical MG (N = 107, 
M — 5, a — 0.3, S = 2, c = 1/2, average over 500 realiza- 
tions) 
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FIG. 8. Average number of speculators versus the num- 
ber of producers (in P units) in the grand canonical MG 
(N = 107, M = 5, a = 0.3, S = 2, c = 1/2, average over 
500 realizations, ) 



VI. SPECULATORS, PRODUCERS AND NOISE 
TRADERS 

The debate about what the noise traders do to a com- 
petitive market is not closed ^0|. In the economics 
literature a noise trader is not very precisely defined. 
Sometimes they are synonym with speculators. We de- 
fine noise traders in the following way: they choose 
their actions without any basis. Compared with specula- 
tors, who analyze carefully the market information, noise 
traders take action in a purely random way (see appendix 
). Noise traders may be speculators who base their ac- 
tion on astrology, on "fengshui" , or on some "random 
number generators" . Our present model allows us to eval- 
uate the influence of noise traders on the market. They 
increase the market volatility er 2 , as shown in Fig. ^ and 
in appendix Therefore, in principle, they do harm to 
themselves as well to other participants. Actually in the 
linear-payoff version that we consider, the average gain of 
speculators and producers is not much affected by noise 
traders, since (j4 no i SC ) = 0. However, it is easy to see that 
in the original version, where gi(t) = — ai(<)sign A(t), 
payoffs are reduced by the presence of noise traders (see 
appendix |5]) . 

Our numerical results of Fig. ^ also shows that deep in 
the symmetric phase, noise traders reduces the volatility 
per agent a 2 / (N + N no i so ) , when this becomes bigger than 
one. This is easy to understand assuming that the only 
effect of noise traders is to increase a 2 by a constant equal 
to A^oisc = r/N. Let o-q/N be the volatility per agent, 
without noise traders (77 = 0) and a 2 that with noise 
traders. The variation in the volatility per agent in the 
presence of noise traders is: 
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N{l + n) N 1 + 1/77 



(26) 



As illustrated by figure numerical simulations globally 
confirm these conclusions, but also show that the effects 
of the noise traders are more pronounced than the theory 
predicts. 
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FIG. 9. Normalized variance of the outcome with (opaque 
circles and without (black squares) noise traders; the dotted 
line is the naive theoretical prediction. Inset : difference of 
variances with and without noise traders (N — 101 specula- 
tors, 50 noise traders, average over 1000 realizations). 



VII. MARKET IMPACT 

In order to quantify the impact of an agent on the 
market let us first consider the case of an external agent 
with S strategies: This agent does not take part in the 
game but just observes it from the outside. From this 
position, each of her strategies gives an average^ virtual 
gain 



-a s {A), 



1, 



(27) 



Given that the strategies are drawn randomly, u s are 
independent random variables. Since u s is the sum of 
F > 1 independent variables a^(A^)/P : their distribu- 
tion is Gaussian with zero mean and variance 



Var(it s 



-^f>r(<)(^} 2 = |. 



Clearly, one of these strategies, that with u s * = max s u s , 
is superior to all others^. It would be most reasonable 
for this agent to just stick to this strategy. 

However, the same agent inside the game will typically 
use not only strategy s* . This is because every strategy, 



7 The average is meant over a long time here 
8 The distribution of it s » can be easily com puted using ex - 
treme statistics. For S S> 1 typically u s * ~ \/2H log (5) /P. 



when used, delivers a real gain which is reduced with re- 
spect to the virtual one by the "market impact" . Imag- 
ine the "experiment" of injecting the new agent in a MG. 



Then (A^) -> (A* 



where, in a first approximation, 



we neglect the reaction of other agents to the new-comer. 
Then the real gain of the newcomer is: 



.9s — - a s(A) - (a s a s ) 



1. 



(28) 



The agent will then update the scores U s (t) with the 
real gain g s for the strategy she uses and with the virtual 
one u s ' — g S ' + 1 ^ a s a s >, for the strategies she does not 
use (in the following, we neglect the term a s a s i) . There- 
fore inductive agents over-estimate the performance of 
the strategies they do not play. Then if strategy s is 
played with a frequency p s , the virtual score increases on 
average by 

SU S = U s (t + l)- U s {t) = Ps g s + (1 - Ps )(g s + 1) 
= 9s - Ps + 1 (29) 

at each time step (on average) . If the agent ends up play- 
ing only n out of her S strategies with some frequency 
p s > 0, it must be that the virtual score increases SU S 
are all equal for these strategies and the virtual scores 
of strategies not played is lower. More precisely let 
s = 1, . . . ,n label the strategies which are played and 
r = n + 1, . . . , S those which are not played. It must be 
that 

5U S = g s - p s + 1 = v s = 1, . . . , n (30) 
SU r = g r + 1 < v r = n+l,...,S. (31) 

These equations yield the number n of strategy that this 
agent will use. Normalization of p s in the first equation 
yields the average virtual gain v of the agent, which is 



1 ™ 



1 



Qs~ - + 
n * — ' n 

s—l 



(32) 



Using p s = g s + 1 — v, we can compute the real gain of 
the inductive agent g — ^2 s —iPs9s- 

Summarizing, we find that inductive agents mix their 
best strategy with less performing ones. This is a conse- 
quence of the fact that they neglect their impact on the 
market. 

So far we did not take into account the reaction of other 
agents to the new-comer. In order to quantify this effect, 
let us consider a MG in the asymmetric phase, and let us 
add a new agent with the best strategy a M = —sign (A 11 ). 
This gives us an idea of this effect in the extreme case 
and we expect that for a randomly drawn strategy the 
effect will be smaller. Neglecting the reaction of other 
agents, we find that the available information with the 
new-comer should be H' ~ H — 2|(A)| + 1. Figure [h] 
shows that the reaction of all agents is indeed negligible, 
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excepted near the critical point, where H is of the order 
of 1. 




FIG. 10. Ratio of real H' over approximated 
H' ~ H - 2\{A) \ + 1 versus a (N = 101, average over 100 
realizations) 



much the agents loose due to their bias, is on average 
negative, due to the impact this bias has on the market. 
The second term in Eq ( |36| ) is always negative, and repre- 
sents the losses due to the switching between strategies, 
which, as shown above, arises from the neglect of market 
impact. Since the probability distribution function of rrii 
is not Gaussian |l, this term gives rise to an non Gaus- 
sian distribution of gj for non frozen agents. The average 
gain of the r-th best agent is represented in figure [l2]. 

By contrast, the term rn\) disappears for a frozen 

agent because m\ = 1. It is replaced by |(^4)Cfc| which 
is always positive and which measures how well agent k 
exploits the available information. Therefore, in average, 
the frozen agents gain more than the non frozen ones. 
This is clearly illustrated in figure |ll| which also shows 
that Eqs. (|3^) and |37]) are exact. Finally, a producer 
is of course frozen, and her gain is always zero, since she 
has | | =0. 



VIII. GAIN 

In this section we show how the behavior and the gain 
of each agent (speculator as well as producer) depends 
on her microscopic constitution and on the asymmetry 
of the outcome A(t) in the asymmetric phase. Let us 
denote the gain of agent i by jj ; by definition, 



M=6, N=127, NIT.100000 



-(Aai 



(33) 



In the asymmetric phase, since the stationary state is 
mean field, (sjSj) = m^mj. Consequently, by expanding 
Eq. (B3) one obtains 



9i =-{A)t 



= -(A)^-(I)|-m i -i?(l-mf). (34) 
Remember that the stationary behavior of agent i is 



described by Vi = —2(A)£i (see section |Hl|) . If an agent is 
non frozen, m = 0, while m, = —signup otherwise, hence 
the gain of a generic agent i is 



g i = -{A)u,i + \(A)S i \-&(l 



(35) 



Note that the second term of above equation vanishes for 
a non frozen agent j and therefore 



9j 



-(A)ujj — £?(1 — mi) non frozen. 



(36) 



On the other hand, the third term of Eq. (|35|) vanishes 
if agent k is frozen: 



= -(A)oj k + \(A)£ k \ frozen. 



(37) 



In Eqs (36) and (p7\), the gain of each agent is expressed 
as her internal constitution, allowing us to interpret what 
does the gain of a general agent depends on. In both 
equations, the first term — {A)u>i, which represent how 




-0.5 0.5 1 

experimental gain 

FIG. 11. Theoretical gain versus experimental gain show- 
ing that the frozen agents gain more than the active ones 
(a = 0.5, M = 6) 



gain 

FIG. 12. Bar-code structure of the r-th best agent's gain 
(a — 10, M = 10, S = 2, average over 300 realizations) 



IX. PRIVILEGED AGENT OR 
INSIDER-TRADING 

In this section we consider a MG where a particular 
agent has different characteristics. In particular we ad- 
dress the question of what additional resources would 
be advantageous for this agent and in which circum- 
stances. In the first subsection, we consider an agent 
with S' strategies (with S' > S, where S is the number 
of strategies assigned to other agents) . The last two sub- 
sections are devoted to the study of effects of asymmetric 
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information, in which an agent has access to privileged 
information which the other cannot access. This can be 
achieved in several ways. First, we consider the case of 
a pure population with memory M and one agent with 
a longer memory M'. Then we consider the case of an 
agent who knows, in advance, how a subset of agents 
plays. 



A. An agent with S' strategies 

In the symmetric phase, no matter how many strate- 
gies an agent has, there is no possibility of gaining. 
Therefore we focus in this section on the asymmetric 
phase. 



As shown in Sect. VII, inductive agents over-estimate 
the performance of the strategies they do not play 

Let us consider now the case where an agent with S' 
strategies enters into a MG. As shown in Sect 



VII 



to 

a good approximation, the value of H/P is the only rel- 
evant information we need to retain of the stationary 
state of the MG without the special agent. This quan- 
tity encodes all other informations such as the number of 
producers, the number of strategies played by the agents 
in the MG and the value of a. 

We carried out numerical simulations, and compared 
it to the analytical results derived in Sect. VII. These 
are shown in the figures 13, for H/P = 0.5, and [l4|, 
for H/P = 1. The virtual gain v is always larger than 
the actual gain g. Even though g is less than the gain 
agents would get playing only their best strategy E[g s *] 
(maximal gain) , it is not much smaller and has the same 
leading behavior g oc Vln S. 

Numerical simulations agree well with analytical re- 
sults, apart from finite size effects which become more 
pronounced if H/P is smalQ 



Figures 



13 



and |TJ refer to values of H/P which are 
realistic of MG with producers. A moderately large S' 
suffices to obtain a positive gain g > 0. With S = 2 and 
without producers H/P ~ 0.1 at most. For these values 
the analytic approach suggests that, even playing only 
her best strategy an agent would need S' > 750 strate- 
gies to have a positive gain, whereas inductive agents 
would need more than S' ~ 2400 strategies to obtain a 
positive gain. The same agent would find that her virtual 
gain becomes positive with only S' > 8 strategies. These 
results for H/P = 0.1 suffer from strong finite size effects 
(which indeed are of the order of P/ H) . One would need 
system sizes N which are well beyond what our compu- 
tational resources allow to confirm these conclusions. 



...«•»* 



FIG. 13. Upper graph : average number of played strate- 
gies (circles) versus S' . Below : average virtual (diamonds) 
and actual (squares) gains versus S' for H/P = 0.5, from 
top to below (averages over 500 realizations). The lines are 
theoretical predictions 




FIG. 14. Upper graph : average number of played strate- 
gies (circles) versus S' . Below : average virtual (diamonds) 
and actual (squares) gains versus S' for H/P — 1, from top 
to below (averages over 500 realizations). The lines are theo- 
retical predictions. 

It is also interesting to observe that the number of 
strategies actually used by the inductive agent increases 
with S (sub-linearly) and it decreases as H/P increases 
(see figures |l3| and |l4|) . That means that if there is more 
exploitable information in the system, agent's behavior 
becomes more peaked on the best strategy. 



B. M' > M 

Let us consider the case of a pure population with 
memory M and one agent with a longer memory^j M' . 
Figure |l^ plots the gain of such an agent with M' = M+l 
as a function of a. The average gain of all agents is also 
shown for comparison. In the asymmetric phase the spe- 



This is m ostly due the term which we have neglected in the 
section VII : it is typically of the order of P/H 



In this kind of numerical simulations, one has to keep the 
dynamics of histories 
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cial agent receives a lower payoff, which can be under- 
stood by observing that she has a number ol histories 
pi _ 2*f _ 2p bigger than that of the pure population. 
Thus her effective a' = 2a is larger, which is detrimental 
in the asymmetric phase. 

The gain of the special agent is the same as that of 
normal agents at the point where there is neither persis- 
tence, nor anti-persistence (a ~ 0.25 for M = 3, and a c 
in the thermodynamic limit). 

By contrast, in the symmetric phase, the game is 
symmetric for normal agents but their anti-persistent 
behavior produces arbitrages who can be exploited by 
agents having a bigger memory. Indeed, as a decreases, 
the available information Hm> for the privileged agent 
growsQ. As a result the gain of the privileged agent be- 
comes larger than that of other agents and as a becomes 
small enough, it becomes positive. 



o 



AM=M'-M 

FIG. 16. Gain of an agent with M' = M + AM within 
a pure population with M — 3 (a = 0.1, average over 1000 
realizations 



C. Espionage 
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FIG. 15. Upper graph : normalized available information 
for M and M + l. Lower graph : Gain of an agent with M + l 
within a pure population with M = 3 (S — 2, average over 
3000 realizations) 

Can the anti-persistence be exploited even more if one 
increases M ' ? The figure [If] answers clearly no. This 
is not surprising since again the effective a is bigger and 
bigger as M' is increased. At the same time, the available 
information increases, but too slowly. 



Some agents may have access to some information 
about other agents. This is the case of a stock broker 
who knows his clients' orders before execution, hence he 
has privileged information and should be barred from 
trading. When there is no available information, as in the 
symmetric phase, an agent who has access to asymmetric 
information can expect at least to lose much less than the 
others agents, or even to have a positive gain. Also, since 
having access to a little information is greatly preferable 
to no information at all, only a very limited amount of 
information is needed to get a considerable advantage. 
Suppose that agent b knows the sign sg of the aggregate 
actions of a subset B of other agents. Let B = \B\ be the 
number of agents in B. Then sg(i) = sign X)ieB ai (^)' 
She can exploit this supplementary information by hav- 
ing two virtual values U^ s (t) and U^ s (t) for each of 
her strategies. In other words, if agent b knows that 
• s e(i) = +1 before having to choose, she takes her deci- 
sion according to the scores (t), that is, 



s b (t) = arg max (t) ; 



(38) 



H M i is defined as H — (A) 2 , but with an average over 
= 1,...,2P. 



she updates the scores of her strategies according to 

^!.(* + l) = C^,(*)-<? ) ^ t) (39) 

and analogously if sg (t) = — 1 . 

What is the kind of the supplementary information this 
agent has access to ? 

Since the outcome is anti-persistent in the symmetric 
phase and persistent in the asymmetric phase, only at the 
critical point there is no long term correlation in the out- 
come Jl3| . Accordingly, the spy always gain more than 
the average, except at the critical point where she gains 
the same (see figure |l7|). With this setting, the agent has 
access in particular to the anti-persistence of the sym- 
metric phase, explaining why even if only one agent is 
spied, the gain of the broker is much bigger (figure |l8|). 
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Finally, the comparison between the two types of asym- 
metric information we have considered shows that it is 
much more interesting to spy than to have a larger mem- 
ory : in the former case, one is sure to win more that the 
normal agents, except at the critical point. 



■ ■ All a 
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that the players are busy about. 

With the introduction of producers the game can be of 
positive sum. We have shown how producers and specu- 
lators live in a symbiosis: producers are passive players 
who do not try to switch strategies. The reason is that 
they volontierly give up the speculation opportunities be- 
cause they have outside business in mind. Thus produc- 
ers inject information that the eager speculators are just 
happy to feed on. The speculators, while making away 
profits, perform a social function by providing liquidity 
thus reducing producers' market impact. We believe this 
is also true in real markets. Numerous other results show 
that it is now possible to systematically study markets 
with heterogeneous agents, whith real questions in mind. 



FIG. 17. Gain of a spy and average gain of all agents ver- 
sus a (7V=101, Nb = 3, 100P iterations, average over 100 
realizations) 



APPENDIX A: GEOMETRIC AND ALGEBRAIC 
APPROACHES TO THE MG 

This appendix is devoted to giving intuitive but rigor- 
ous views of what happens in the MG. 




1 10 100 

FIG. 18. Gain of a spy versus the number of spied agents 
(N = 1001, a = 0.15, average over 1000 realizations) 

X. CONCLUSIONS 

In this work we have shown how to ask questions about 
real market mechanisms in a tay model. In spite of the 
severe simplification of the MG, with little modification 
one is able to study a broad spectrum of problems which 
could be dreamed of previously. The central result is 
to show that agents with limited rationality (or limited 
information processing power) can only make a market 
marginally efficient. To the first approximation one can 
say that these inductive players can maintain an approx- 
imate equilibrium, which is the central result of the El- 
Farol model. But studying carefully the fluctuations one 
finds that the fact that the market is more or less efficient 
does not imply that one can stop playing and sit at a ran- 
domly chosen site. Doing so would make the model less 
efficient. It is around this residual (marginal) inefficiency 



1. Geometric approach to the MG 

The global behavior of the MG, measured by a 2 , can 
be quite well understood with a geometrical approach. 
Indeed, it is directly related to a much more intuitive 
geometrical concept : the Hamming distance between 
agents ||, which is defined as follows for agents i and j: 

— — (a» — a.,) 2 1 1 , . . 

dij = 1 4 J ' = 2 - fnaj- (Al) 

It is worthwhile to note that 1 — dij equals the probabil- 
ity that both agents take the same action for a randomly 
drawn history, so for an agent, maximizing her distance 
with respect to all other agents is equivalent to maxi- 
mizing her gain. Since the game is dynamical, one has 
to consider the time average of the actual Hamming dis- 
tance between those two agents 



(di,j) = g - 2< a ^')' ( A2 ) 
The average Hamming distance per agent is then 

W = NW^T) 5^ = V 2A^bl) 5^ 

(A3) 

The relationship between the distance and the fluctua- 
tions arises naturally by rewriting the latter as 

a 2 =N + J2^a~)^ (A4) 
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that is, as a sum over random fluctuations and correla- 



tions. Putting Eqs (A3) and (A4) together, one finds 



£ = l-2(*-l)(i-<d> 



(A5) 



This equation^] links the geometrical and the 

analytical approaches |§|L3|. It states that finding the 
average Hamming distance between the agents is equiv- 
alent to determining a 2 by the analytical tools used in 
^,[7^13]]. In general, it is impossible to find the average 
distance with a geometrical approach due to the fact that 
the Hamming distance is not transitive^. However, in 
the so-called reduced space of strategies (RSS) ||, the 
distance is transitive, consequently Johnson et al. could 
find an approximate ana lytical expression of (d), and, by 
implicitly using Eq. ( |A5| ) (which is straightforward in the 
RSS), they also gave an approximative expression of a 2 
Tl|. An equation quite similar to Eq. (A5) also appears 




in |22], where it is shown that perceptrons playing the 
MG can cooperate. 



When a < a c , there are much more free variables (N 
indeed) than equations: the solutions of Eqs. ( |A^ ) then 
belong to a subspace of dimension N — P. This allows 
the anti-persistent behavior to take place, because the 
system is free to move on this subspace. In the special 
case c = 0, since there is no bias f2 M = 0. The linear 
system of equation is then homogeneous and the solution 
(si) = for all i always exists. In particular, if a > 1, 
this solution is unique, hence a 2 /N = 1. When a < 1, 
a subspace of solutions of dimension N — P arises, and 
the anti-persistent behavior also takes place. Note that 
in this case, the system is always in the symmetric phase, 
therefore there is no phase transition. 

This argument easily generalizes to S > 2 strategies [Q . 
If agents use, on average, n(S) strategies (and S — n(S) 
are never used) the number of free variables is Nn(S). 
There are P plus N equations which these have to sat- 
isfy, where the latter N comes from the normalisation 
condition on the frequency with which each strategy is 
used. At the critical point, these two numbers are equal, 
and we find 



2. Algebraic approach to the phase transition 

We expose the algebraic origin of the phase transition. 
As it has been recalled, the agents actually try to mini- 
mize the available information H , and can actually 
cancel it when a < a c . Let us see why. Since J? is a 
sum of P non negative averages (A) 2 , H = only if all 
averages are zero, namely (^4) = V/i, or equivalently 

N 

= -fi" (A6) 

4=1 

These are P linear equations in N variables. However 
the N variables m=(sj) are restricted to the [—1,1] in- 
terval. Above OL c there are Ncj> variables which are frozen 
at the boundary of this interval (m, = ±1)- Therefore 
there are (1 — 4>)N free variables only. As shown in refs. 

, the point a c marks the transition below which the 
system of equations ( |A6| ) becomes degenerate, i.e. when 
there are more variables than equations. Exactly at a c 
the number of free variables (1 — 4>)N exactly matches 
the number of equations P. Dividing this equation by N 
gives an equation for a c , 

a c = 1 - (f> (A7) 

which is indeed confirmed nuerically to a high accuracy. 



12 It is exact for any 5; even more, it remains exact if agents 
have not the same number of strategies. 
13 The knowledge of djj and d^k does not allow that of dj^. 



n c (S)=a c (S) + l. (A8) 

At the critical point nearly one half of the strategies yield 
positive virtual gain and are used, whereas the others are 
not used 0. From this we find 

a c (S) = ac(2) + I - 1 (A9) 

This shows that actually a c grows linearly with 5 1 , but 
in a slightly less simple way that the one previously be- 
lieved glll^). 

Let us now show how the behavior of the agents is 
related to persistence/anti-persistence. We define W as 
the average over the agents of v 2 : 

1 N 

W =N^ {V * )2 (A10) 

i=l 

T N 

= ^ T 2 E Y.&^mm (ah) 

t,t' = l i=l 

since the £f are independently drawn, 
1 N 

■ft E = (! - C )<W + OQ./VN). (A12) 
i=l 

Thus for large N 

1 T 

W~ Mm -Y,{A{t)A{t-r)\n{t)=n{t-r)) (A13) 



where (A(t)A(t — r)\fj,(t) = n(t')) means that the average 
is taken over time for all t and t such that /it = jttt- T , 
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and summed over all histories. A closely realated quan- 
tity was first studied in ref. ]l3| where it was shown to 
quantify anti-persistence in the symmetric phase. 

Note that this equation implies that there can be no 
frozen agents unless the outcome exhibits persistence (i.e. 
(A(t)A(t - T)\fi(t) =fi(t-r)) ^ 0), which agrees with 
the analysis of |0| . In this case we find 



(A14) 



On the other hand, no agent can freeze unless H is 
non zero, which is the same as saying that the outcome is 
persistent. Furthermore, the condition of freezing Vi ^ 
is equivalent to fl3|| 



^i^i ^ hi , 



(A15) 



where hi = fl^i + J2j^i ( s j) ■ ^ i s worthwhile to see 
that is the internal hamming distance. Eqs ( |A13| ) 
and ( A14 ) give global condition s wh ether there can be 
frozen players or not, while Eq. (A15) give conditions on 
individual freezing. 



APPENDIX B: THE MG IN BIOLOGY 

The MG model has another important application in 
biology: the sex ratio of 50:50. In the widely read book of 
Richard Dawkins " Selfish Gene" |Q , the Fischer theory 
was brilliantly explained: if in the offspring pool either 
males or females were in minority, reproductive strate- 
gies for giving birth to a member in that minority would 
enjoy a genetic advantage linearly proportional to the de- 
viation from the 50:50 ratio. The stable ratio is thus dy- 
namically maintained. Brian Arthur's "El Farol" model, 
is also of the same genre, to show that using alternative 
strategies can lead to equilibrium. MG goes one step fur- 
ther: while the equilibrium point is previously solved in 
different contexts by Fisher, Arthur et al, we concentrate 
on more refined questions. 



APPENDIX C: REPLICA METHOD FOR THE 

MG 

For the sake of generality, we consider three different 
population of agents: 

1. the first population is composed of N speculators. 

These are adaptive agents and they have each two 
speculative strategies { , i for i = 1, . . . , N and 
p = 1, . . . , P. These are drawn at random from the 
pool of all strategies, independently for each agent. 
We allow a correlation among the two strategies of 
the same agent: 



P ( a h a l) = 2 [ S a r ,+l$ai,+l + £a T ,-l<*ai,-x] 

1 — c 

I (ci) 

Note that, for c = agents choose just one strategy 
a-\ and fix aj = —a-\ as its opposite, whereas for 
c = 1 they have one and the same strategy a-\ = a|. 
The original random case corresponds to c = 
1/2. These agents assign scores U s ^(t) to each of 
their strategies and play the strategy Si (t) with the 
highest score, as discussed in the text. Therefore 
for speculators: 



t(t),i- 



(C2) 



2. then we consider Ap"^j P = pN producers: They 
have only one randomly and independently drawn 
strategy 6f so 



-^prod 



(t)=b 



(C3) 



Producers have a predictable behavior in the mar- 
ket and they are not adaptive. Instead of pN in- 
dependent producers one can also consider N 
correlated producers who all have the same pre- 



dictable behavior 6p rod . 



3. Finally we consider nN noise traders. These are 
defined as agents whose actions are given by 



} (t) = randomsign. 



(C4) 



Each noise trader as a random number generator 
which is independent of each other agent. 



It has been shown |6|,[7| that the stationary state prop- 
erties of the MG are described by the ground state of H. 
Note that this approach fails however to reproduce the 
anti-persistent behavior which is at the origin of crowd 
effects in the symmetric phase. In our case 



A(t) — A spoc (t) + ^4prod(i) + A no [ EO (t) 



where 



N 



Ajpcc(i) — ^2 a sl(t),j 



and 



pN 



A piod (t) =Y,bf t] ^A 



prod 



(C5) 



(C6) 



(C7) 



and A no i sc (t) = 2k(t) — nN where k(t) is a binomial ran- 
dom variable with P(k) = ( , '^)2-'' Jv . Since H = (A) 2 
and the contribution of noise traders to (A^) vanishes 
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(^4noise) = 0, the collective behavior of the system is in- 
dependent of r\. Noise traders shall contribute a constant 
T\N to a 2 and will not affect other agents. This only holds 
in the asymmetric phase (see text). We can then reduce 
to the study of speculators and producers only. 



Let us define, for convenience, A^ = Ag pec 
where 



N 

= V 

spec / j 



t,*~ 



(C8) 



and Ap rod is given in Eq. (C7). Here Si is the dynamical 
variable controlled by speculator i. We shall implicitly 
consider directly time averaged quantities so Sj is a real 
variable in [— 1, 1] rather than a discrete one. The pa- 
rameter A is inserted so that, once we have computed 
the energy H = (yl spcc + Ayl pro d) 2 we can compute the 
total gain G pro d of producers by 



-AA T 



od = 



ldH 
2~d\ 



A=l 



The gain of speculators is obtained subtracting this con- 
tribution and that of noise traders from the total gain 



G sp ec = -a 2 + V N-G 



prod • 



(C9) 



1. Replica calculation 

The zero temperature behavior of the Hamiltonian H 
can be studied with spin glass techniques |^5|,0. We 
introduce n replicas of the system, each with dynamical 
variables Si jC , labeled by replica indices c,d = l,...,n. 
Then we write replicated partition function: 



(Z«(/?)}=Tr s ]>-- (A!f)2 > C 



(CIO) 



where the average is over the disorder variables a^ i; 



and Tr s is the trace on the variables Si >c for all i and 
c. Following standard procedures G^,E3, we introduce 
a Gaussian variable so that we can linearize the ex- 
ponent in Eq. ( C10| ). This allows us to carry out the 
averages over a's and b's explicitly. Then we introduce 
new variables Q c ,d and r Cj( j with the identity 

1 = J dQ c ^ d 5 (^Qc,d - -jy Si^s lt dj 
oc / dr c , d dQ c , d e-^ r ^( NQ ^-^ Si ^) 



for all c > d, which allow us to write the partition func- 
tion (to leading order in TV) as: 



(Z n (/3)) = / dQdfe 



with 

F(Q,f) 



-Nn@F(Q,f) 



— TrlogT+^-E^ 



1 

n/3 



log 



Tr 



(Cll) 



T a ,b = $a,b H [C + p + (1 - c)Q a ,b] 



The matrix T is given by 

w 

a 

For correlated producers we would have obtained the 
same result but with p — > p + p 2 Ne 2 , where e measures 
the bias of producers towards a particular action for a 
given p,, or equivalently the correlation between the ac- 
tions of two distinct producers. More precisely e 2 is the 
average of 6f bj for i ^ j and for all fi. Therefore the 
limit p — > cxd also corresponds to a small share of pro- 
ducers p <C 1 with a small bias e ^ 0. Note that a bias 
e ~ \/N corresponds indeed to ~ N independent pro- 
ducers. Equivalently ~ \/N correlated producers, with e 
finite are equivalent to ~ N independent producers. 
With the replica symmetric ansatz 

Q c ,d = q + (Q - q)S c ,d, r c>d = 2r+(R- 2r)5 c>d 

the matrix T has n — 1 degenerated eigenvalues Ao = 
■y _l_ 2(i-c)/3(i-g) an( ^ Qne e jg enva i ue equal to \i — 



2/3[c+p+(l-c)q] 
a 

dard algebra, 



n 



1 



2(l-c)/3(l-g) 



therefore, after stan- 



F (RS \q, r) = ^lcg 



2(1 - c)f3(Q - q) 



+ 



a[c + p + (1 — c)q] 
a + 2{l-c)P{Q-q) 



^(RQ-rq) 



1 r 1 

-(log / dse- 13 ^) (C12) 
P J-i 



where we found it convenient to define the "potential" 
a0(R-r) 2 



V z (s) 



ar z s 



(C13) 



so that the last term of F^ RS ^ looks like the free energy 
of a particle in the interval [—1,1] with potential V z (s) 
where z plays the role of disorder. 

The saddle point equations are given by: 
dF (RS) 



dq 
q F (RS) 

dQ 
q F (RS) 

dR 
g F (RS) 

dr 



= 



= 



= 



= 



4(l-c)[c + p+(l-c)«] 



[a + 2(1 
j3{R -r) = - 



c)/3(Q - q)] 2 
2(1 -c) 



a + 2(1 -c)f3{Q-q) 



Q = «s 2 )) 
m -q) = 



(C14) 
(C15) 
(C16) 
(C17) 
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where ((•)) stands for a thermal average over the above 
mentioned one particle system. 

In the limit — ► we can look for a solution with 
q — > Q and r — > i?. It is convenient to define 



2(1 - c)/?(Q - g) 
X = : , and C 



a 



~(3(R-r) 
r 



(C18) 



and to require that they stay finite in the limit — > oo. 
The averages are easily evaluated since, in this case, they 
are dominated by the minimum of the potential V z {s) = 
y/ar((s 2 /2 — zs) for s G [—1,1]. The minimum is at 
s = — 1 for z < — C and at s = +1 for z > £. For 
— £ < z < £ the minimum is at s = z/C,. With this we 
find 



and 

«* 2 » = 



2V^ 

7T 



(C19) 



= 1-1 - 



CV In/2 



(C20) 



With some more algebra, one easily finds: 



X 



a/erf 



V2 



1 



(C21) 



Finally £ is fixed as a function of a by the equation 
^ e'^l 2 /_ 1\ „ / C A a 1 + p 



c 



n /(\ a 1 



(C22) 



Note that £ only depends on the combination (l+p)/(l — 
c) which runs from 1 - for p = c = i.e. no producers 
and "perfect" speculators - to oo. The latter limit occurs 
either if c — > 1, i.e. when speculators become producers, 
or if p —+ oo (many producers). 

Eq. ( |C21 ) means that \ diverges when a — > a c (p, c) + , 
which then implies that at the critical point 



erf 



V2 



(C23) 



This back in the other saddle point equations, yields the 
following equation for £ = Q c : 



p 



(C24) 



The free energy, at the saddle point, for f3 — > oo, is 

^ - •""(UxT" (C25) 



where Q and x take their saddle point values Eqs. (C2C) 
and ( |C2j] ) 



The gain of producers, from Eq. (C12), is 

Gprod _ P 



A r 



1 + X 



(C26) 



and that of speculators is obtained from Eq. 

At a c x — * 00 so that F( RS ^> -> 0. Note that the loss 
of producers vanishes L pro d -> as a -» a c , whereas the 
loss of speculators L spcc = (1 — Q)/2 is always positive 
below a c . 

The phase diagram is shown in figure |5|. here we discuss 
some limits. 



APPENDIX D: THE SIGN MG 

The original MG (jj is defined with payoffs 

g i = -a i (t)signA(t) (Dl) 

Over a long period of time T, the change in Aj (t) is given 
by 

Aj(i + T) — Aj(t) 1^' (r) . . 



T 

p 



~ I £ [2Prob{^(r) > 0|/x(r) = /i} - 1] £f (D2) 

Then, for any fixed /z, the relevant quantity is the prob- 
ability that A(t) > 0, when fi(t) = p. This can be com- 
puted within our mean-field approximation: Indeed if 
(sj) = rrii we can regard Sj as a random variable with 
distribution 



P{Si = ±1) 



1 ± m, 



Then, the relation A^{t) = W+J2?=i €i s i implies that 
we can consider as a Gaussian variable with variance 



JV 



Var(^) = i 1 - ™i) + ^ 



This allows us to compute 

Prober) > 0\p(t) = a) = ^erfc ( - ==2 | . 
1 V ^ 2 ^ v /2Var(^l^) ^ 

Note that close to the critical point a c , (A^) is very small 
compared to Var(A M ), which means that it is legitimate 
to expand the erfc function to linear order. This gives us 
back a linear minority game, but with 



9i 



ai(t)A(t) 



(D3) 



1G 



Note, then that when r\ increases the gains for each spec- 
ulator decreases. This is actually true even away from 
a c . It is indeed easy to check that (sign A(t)) decreases 
as r\ increases. 
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